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Gaucher's disease (GD) is an autosomal recessive disorder caused by the deficiency of 
glucocerebrosidase, a lysosomal enzyme that catalyses the hydrolysis of the glycolipid 
glucocerebroside to ceramide and glucose. Polymorphisms in GBA gene have been 
associated with the development of Gaucher disease. We hypothesize that prediction of 
SNPs using multiple state of the art software tools will help in increasing the confidence 
in identification of SNPs involved in GD. Enzyme replacement therapy is the only option 
for GD. Our goal is to use several state of art SNP algorithms to predict/address harmful 
SNPs using comparative studies. In this study seven different algorithms (SIFT MutPred, 
nsSNP Analyzer, PANTHER, PMUT, PROVEAN, and SNPs&GO) were used to predict 
the harmful polymorphisms. Among the seven programs, SIFT found 47 nsSNPs as 
deleterious, MutPred found 46 nsSNPs as harmful. nsSNP Analyzer program found 43 
out of 47 nsSNPs are disease causing SNPs whereas PANTHER found 32 out of 47 as 
highly deleterious, 22 out of 47 are classified as pathological mutations by PMUT, 44 out 
of 47 were predicted to be deleterious by PROVEAN server, all 47 shows the disease 
related mutations by SNPs&GO. Twenty two nsSNPs were commonly predicted by all 
the seven different algorithms. The common 22 targeted mutations are F251L, C342G, 
W312C, P415R, R463C, D127V, A309V, G46E, G202E, P391L, Y363C, Y205C, W378C, 
I402T S366R, F397S, Y418C, P401L, G195E, W184R, R48W, and T43R. 

Keywords: glucocerebrosidase, SIFT, MutPred, PANTHER, PMUT, PROVEAN, SNPs&GO 



INTRODUCTION 

Gaucher's disease (GD) is a rare genetic disease in which fatty 
substances accumulate in cells and certain organs (James et al., 
2006). It is a common lysosomal storage disorder and results 
from an inborn deficiency of the enzyme glucocerebrosidase 
(also known as acid P-glucosidase). This enzyme is responsible 
for glucocerebroside (glucosylceramide) degradation. The accu- 
mulation of undegraded substrate generally happens because of 
enzyme deficiency, mainly within cells of the macrophage lineage 
or monocyte, and it is responsible for the clinical manifestations 
of the disease (Beutler and Grabowski, 2001). This glucosylce- 
ramide degrading enzyme is encoded by a gene named GBA, 
which is 7.6 kb in length and located in lq21 locus. Recessive 
mutation in GBA gene affects both males and females (Horowitz 
et al, 1989; Zimran et al, 1991; Winfield et al, 1997). GBA 
protein is 497 amino acids long with the molecular weight of 
55.6 KD. GBA enzyme catalyses the breakdown of glucosylce- 
ramide, a cell membrane constituent of white blood cells and 
red blood cells. The macrophages fail to eliminate the waste 
product and results in accumulation of lipids in fibrils and this 
turn into Gaucher cells (Aharon et al., 2004). GD can be clas- 
sified into three classes namely types 1, 2, and 3. In type 1, 
Glycosylceramide accumulate in visceral organs whereas in type 
2 and 3, the accumulation is in the central nervous system 
(Grabowski, 2008). 



The international disease frequency of GD is 200,000 except 
for areas of the world with large Ashkenazi Jewish populations 
where 60% of the patients are estimated to be homozygous, which 
accounts for 75% of disease alleles (Pilar et al, 2012). Almost 
300 unique mutations have been reported in the GBA gene, with 
distribution that spans the entire gene. These include 203 mis- 
sense mutations, 18 nonsense mutations, 36 small insertions or 
deletions that lead to frameshift or in-frame alterations, 14 splice 
junction mutations and 13 complex alleles carrying two or more 
mutations (Hruska et al., 2008). The single nucleotide variations 
in the genome that occur at a frequency of more than 1% are 
referred as single nucleotide polymorphisms (SNPs) and in the 
human genome, SNPs occur in just about every 3000 base pairs 
(Cargill et al, 1999). 

Nearly 200 mutations in the GBA gene have been described 
in patients with GD types 1, 2, and 3 (Jmoudiak and Futerman, 
2005). L444P mutation was identified in GBA gene in patients 
with GD types 1, 2, and 3. The L444P substitution is one of the 
major SNP associated with the GBA gene. D409H, A456P, and 
V460V mutations were also identified in patients with GD (Tsuji 
et al., 1987; Latham et al., 1990). Previous findings have shown 
that, in 60 patients with types 1 and 3, the most common Gaucher 
mutations identified were N370S, L444P, and R463C. (Sidransky 
et al, 1994). The other mutation E326K had been identified in 
patients with all three types of GD, but in each instance it was 
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found on the same allele with another GBA mutation. Also, Park 
et al. identified the E326K allele in 1.3% of patients with GD and 
in 0.9% of controls, indicating that it is a polymorphism (Park 
etal, 2002). 

The harmful SNPs for the GBA gene have not been pre- 
dicted to date in silico. Therefore we designed a strategy for 
analyzing the entire GBA coding region. Different algorithms 
such as SIFT (Ng and Henikoff, 2001), MutPred (Li et al, 
2009), nsSNP Analyzer (Bao et al, 2005), PANTHER (Mi 
et al, 2012), PMUT (Costa et al, 2002), PROVEAN (Choi 
et al, 2012), and SNPs&GO (Calabrese et al, 2009) were uti- 
lized to predict high-risk nonsynonymous single nucleotide 
polymorphisms (nsSNPs) in coding regions that are likely 
to have an effect on the function and structure of the 
protein. 

MATERIALS AND METHODS 
DATA SET 

SNPs associated with GBA gene were retrieved from the single 
nucleotide polymorphism database (dbSNP) (http://www.ncbi. 
nlm.nih.gov/snp/), and are commonly referred by their reference 
sequence IDs (rsID) (Wheeler et al., 2005). 

VALIDATION OF TOLERATED AND DELETERIOUS SNPs 

The type of genetic mutation that causes a single amino acid sub- 
stitution (AAS) in a protein sequence is called nsSNP. An nsSNP 
could potentially influence the function of the protein, subse- 
quently altering the phenotype of carrier. This protocol describes 
the use of the Sorting Intolerant From Tolerant (SIFT) algorithm 
(http://sift.jcvi.org) for predicting whether an AAS affects protein 
function. To assess the effect of a substitution, SIFT assumes that 
important positions in a protein sequence have been conserved 
throughout evolution and therefore at these positions substi- 
tutions may affect protein function. Thus, by using sequence 
homology, SIFT predicts the effects of all possible substitutions 
at each position in the protein sequence. The protocol typically 
takes 5-20 min, depending on the input (Kumar et al., 2009). 

PREDICTION OF HARMFUL MUTATIONS 

MutPred (http://mutdb.org/mutpred) models structural features 
and functional sites changes between mutant sequences and wild- 
type sequence. These changes are expressed as probabilities of 
gain or loss of structure and function. The MutPred output con- 
tains a general score (g), i.e., the probability that the AAS is 
deleterious/disease-associated and top five property scores (p), 
where p is the P-value that certain structural and functional 
properties are impacted. Certain combinations of high values of 
general scores and low values of property scores are referred to as 
hypotheses (Li et al, 2009). 

IDENTIFYING DISEASE-ASSOCIATED nsSNPs 

nsSNP Analyzer (http://snpanalyzer.uthsc.edu) is a tool to predict 
whether a nsSNP has a phenotypic effect (disease-associated vs. 
neutral) using a machine learning method called Random Forest, 
and extracting structural and evolutionary information from a 
query nsSNP (Bao et al., 2005). 



PREDICTION OF DELETERIOUS nsSNPs 

PANTHER (http://pantherdb.org/tools/csnpScoreForm.jsp) esti- 
mates the likelihood of a particular nsSNP to cause a functional 
impact on a protein (Thomas et al, 2003). It calculates the sub- 
stitution position-specific evolutionary conservation (subPSEC) 
score based on the alignment of evolutionarily related proteins. 
The subPSEC score is the negative logarithm of the probability 
ratio of the wild-type and the mutant amino acids at a particu- 
lar position. The subPSEC scores are values from 0 (neutral) to 
about — 10 (most likely to be deleterious). 

PREDICTION OF PATHOLOGICAL MUTATIONS ON PROTEINS 

PMUT (http://mmb2.pcb. ub.es:8080/PMut) uses a robust 
methodology to predict disease-associated mutations. PMUT 
method is based on the use of neural networks (NNs) trained 
with a large database of neutral mutations (NEMUs) and patho- 
logical mutations of mutational hot spots, which are obtained by 
alanine scanning, massive mutation, and genetically accessible 
mutations. The final output is displayed as a pathogenicity index 
ranging from 0 to 1 (indexes > 0.5 single pathological mutations) 
and a confidence index ranging from 0 (low) to 9 (high) (Costa 
et al., 2005). 

PREDICTING THE FUNCTIONAL EFFECT OF AMINO ACID 
SUBSTITUTIONS 

PROVEAN (Protein Variation Effect Analyzer) (http://provean. 
jcvi.org) is a sequence based predictor that estimates the effect of 
protein sequence variation on protein function (Choi et al., 2012). 
It is based on a clustering method where BLAST hits with more 
than 75% global sequence identity are clustered together and top 
30 such clusters from a supporting sequence are averaged within 
and across clusters to generate the final PROVEAN score. A pro- 
tein variant is predicted to be "deleterious" if the final score is 
below a certain threshold (default is —2.5), and is predicted to be 
"neutral" if the score is above the threshold. 

PREDICTION OF DISEASE RELATED MUTATIONS 

The SNPs&GO algorithms (http://snps-and-go.biocomp.unibo. 
it/snps-and-go/) predict the impact of protein variations using 
functional information encoded by Gene Ontology (GO) terms 
of the three main roots: Molecular function, Biological process, 
and Cellular component (Calabrese et al., 2009). SNPs&GO is 
a support vector machine (SVM) based web server to predict 
disease related mutations from the protein sequence, scoring 
with accuracy of 82% and Matthews correlation coefficient equal 
to 0.63. SNPs&GO collects, in a unique framework, informa- 
tion derived from protein sequence, protein sequence profile and 
protein functions. 

RESULTS 

nsSNPs FOUND BY SIFT PROGRAM 

Protein sequence with mutational position and amino acid 
residue variants associated with 97 missense nsSNPs were sub- 
mitted as input to the SIFT server, and the results are shown in 
Table 1. The lower the tolerance index, the higher the functional 
impact a particular amino acid residue substitution is likely to 
have and vice versa. Among the 97 nsSNPs analyzed, 47 nsSNPs 
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Table 1 | Tolerated and deleterious nsSNPs using SIFT. 



S. No 


rsID 


Alleles 


Position 


AA change 


Prediction 


Score 


1 


rs IZ lyUoo 14 


1 A / 

L/V 


371 


Leu/vai 


Damaging 


0.04 




1 0 1 QAQ0 1 Q 


r/L 


ZOl 


Dl^n /I Al ■ 

rne/Leu 


Damaging 


U.U1 


3 


rs iz lyuoo iz 


1/ /M 


7Q 

/y 


Lys/Asn 


Tolerated 


0.52 


A 


1 s 1 z 1 yuoo 1 1 




o/ / 


oiy/oer 


Damaging 


n no 
u.uz 


C 

D 


rs IZ lyuoo lu 


v/i- 


OQQ 


vai/rne 


Damaging 


n ni 
U.U 1 


6 


rs 1 z 1 yuooUo 


n/o 


353 


A 1-^ /r* ii 1 
Arg/bly 


Tolerated 


0.38 


-1 

1 


rs IZ lyuooU/ 


c rr 
b/ 1 


364 


Ser/Thr 


Tolerated 


0.1 2 


o 
8 


rsizi^UooUb 


U/la 


342 


uys/(jiy 


Damaging 


0.01 


9 


rslZiyUoJUb 


G/R 


325 


Gly/Arg 


Tolerated 


0.44 


10 


rs1 21 908304 


W/C 


312 


Trp/Cys 


Damaging 


0.00 


11 


rsiziyuoJUJ 


FA/ 


216 


PheA/al 


Damaging 


0.00 




rs 1 z 1 yuoouz 


V/L 


10 


\/-:,l /I ,-,11 

vai/Leu 


Tolerated 


n n7 
U.U/ 


13 


rs 1 z 1 yuooU i 


^ /c 
b/b 


478 


^ ii i/c^i- 
bly/ber 


Tolerated 


0. 17 


14 


rs 1 z 1 yuoouu 


V/l_l 

Y/H 


212 


lyr/His 


Damaging 


0.03 


10 


rs 1 z 1 yucszyy 


P/Q 


100 
IZZ 


rro/oer 


Tolerated 


n o7 
u.o / 


lu 


rs IZ lyubzyb 


D/l 

r/L 


OQQ 

zoy 


Pro/Leu 


Tolerated 


n /i Q 
U.4o 


17 


rs IZ lyuozy/ 


IsyU 


157 


1 1 ir- /r* 1 n 

Lys/bin 


Tolerated 


0.06 


1o 




D /D 

r/n 




Pro/Arg 


Damaging 


U.UU 




rSoUoob/ / o 


D /Ul 

n/n 


4yD 


Arg/nis 


Tolerated 


n 1 Q 

u. 1 y 


on 


rsoUoou/ /z 


n/n 


/IRQ 
400 


Arg/nis 


Tolerated 


n nK 

U.UD 


Z1 


rsoUobb//1 




463 


Arg/Cys 


Damaging 


0.02 


22 


rSoUooD /by 


\ / /I 
V/L 


394 


\ /i 1 /I 
vai/Leu 


Damaging 


0.03 


Zo 


rSoUooD /Do 


A/T 
A/ 1 


OOO 


AU/Thr 

Aia/ 1 nr 


Tolerated 


n OQ 
u.oy 




rsesUoDD/Do 


R/l 
n/L 


101 


Arg/LGU 


Tolerated 


n 0/1 

U.Z4 


ZO 


rSoUZUoU4D 


D/l 
r/L 


1 QO 

loZ 


Pro/Leu 


Damaging 


n nn 
U.UU 


26 


rSoU 1 1 DDOO 


b/U 


265 


biy/Asp 


Damaging 


0.00 


0"7 
Z/ 


rSoUUZUcSUo 


K/l /I 

IVI/I 


4 lo 


K/l,-,+ /l l,-i 
IVIGT/IIG 


Tolerated 


n /I o 
U.4Z 


28 


rs/yy4o/4 1 


r/L 


213 


DU ^ /I 

rne/Leu 


Tolerated 


0.18 


29 


rs/s/sbUbl 


U/ V 


127 


Asp/ vai 


Damaging 


0.00 


30 


rs/ybyboo i 


n/H 


285 


Arg/His 


Damaging 


0.00 


31 


rs/ybb3/y / 


R/Q 


120 


Arg/bIn 


Damaging 


0.00 


oo 
oZ 


rc"70CQ"7d "7 

rs/yoo /o 1 / 


D/l 

r/L 


1 OO 

IZZ 


Pro/Leu 


Damaging 


n no 
U.UZ 


OO 


rc"70o 1 Koon 
rs/yz 1 bzzu 


D/R 
K/h 


ZoO 


rro/Arg 


Damaging 


n nn 
U.UU 


34 


rs/y 1 obo /U 


C/l 

r/L 


41 / 


DU^ /I 

rrie/Leu 


Damaging 


0.01 


OO 


rs /yy/o lUo 


R IC\ 
n/Li 


OK7 

ZO/ 


Arg/bin 


Tolerated 


n nc 
U.UO 


36 


rs/yy 1 1 Z4b 


^ A/ 

b/V 


189 


k y A /-,l 

biy/vai 


Damaging 


0.02 


0"7 

o/ 


rs/ooUZU4y 


U/b 


409 


Asp/blU 


Tolerated 


0.32 


38 


1- "7 O "7C n "7 "7/1 

rs/o/oy / /4 


D /(^ 

n/U 


4o 


Arg/bin 


Tolerated 


0.06 


OQ 
OC7 


ro7D71 QQ 

rs / £5 / 1 0 1 yy 


u/t 


OCSU 


ASp/blU 


Damaging 


n nn 

U.UU 




rs/oosbbbU 


A /I/ 

A/ V 


309 


Ala/ vai 


Damaging 


0.00 


/1 1 
4 1 


rs/o 1 yoZo4 


Ul /R 

n/n 


011 
o 1 1 


Ulic/A m 

nis/Arg 


Damaging 


n nn 
U.UU 


yio 
4Z 


rs/o 1 ooZUb 


A /r\ 
A/U 


318 


Ala/Asp 


Tolerated 


0.63 


4o 


ro77QRQQ7K 

rs/ /yoyy/D 


^/l/l 
IVI/I 


1 OO 
IZo 


N/1q+/I Iq 

iviei/i le 


Tolerated 


1 nn 

I.UU 




rc77QQ/1 7/1 7 
rs / /oo4/4/ 


1 /c 
l/b 


1 ly 


1 1 (-1 /C ,-1 r 

iie/oer 


Tolerated 


n o/i 
U.o4 


45 


rs77829017 


G/E 


46 


Gly/Glu 


Damaging 


0.01 


46 


r ^ ~7 ~7 "7 T O 0 1 

rs/ //joboZ 


N/l 


392 


Asn/lle 


Damaging 


0.00 


47 


rs77451368 


G/E 


202 


Gly/Glu 


Damaging 


0.02 


48 


rs77369218 


D/V 


409 


Asp/Val 


Tolerated 


0.06 


49 


rs77321207 


Y/C 


395 


Tyr/Cys 


Damaging 


0.00 


50 


rs77284004 


D/A 


380 


Asp/Ala 


Damaging 


0.00 


51 


rs77035024 


F/L 


411 


Phe/Leu 


Tolerated 


0.30 


52 


rs77019233 


N/K 


117 


Asn/Lys 


Tolerated 


0.21 


53 


rs76910485 


P/L 


391 


Pro/Leu 


Damaging 


0.00 



(Continued) 
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Table 1 | Continued 



S. No 


rsID 


Alleles 


Position 


AA change 


Prediction 


Score 


54 


rs/D/bo/ Id 


M /C 


370 


Asn/Ser 


Damaging 


0.05 


55 


rs/o/bo/ Id 


WIT 

IM/ 1 


370 


Asn/ 1 nr 


Damaging 


0.04 


56 


rs/booyo 14 


T/i 

I/I 


323 


1 nr/lie 


Tolerated 


0.48 




rs/D^Zoiz^ 


Y/U 




lyr/uys 


Damaging 


U.UU 


CO 
DO 


rs/bUzblU^ 


Y/0 


205 


Tyr/Cys 


Damaging 


0.00 




■S/DU 143 l9 




0/0 


Tip/Cys 


Damaging 


U.UU 


60 


rs/byD49Ub 


F/L 


37 


P he/Leu 


Tolerated 


0.30 




rs/bb/ lUzy 


U/N 


44o 


Asp/Asn 


Tolerated 


0.93 


62 


rs/bbJb/by 


A/E 


190 


Ala/GIu 


lolerated 


1.00 


63 


rs75564605 


l/T 


402 


lle/Thr 


Damaging 


0.04 


64 


rs/bb4o4U1 


T/M 


369 


1 hr/lVlet 


lolerated 


0.08 


cc 
DO 


rs/3b^o434 


C /D 


occ 
ODD 


oer/Ary 


Damaging 


n no 


66 


rs/b3obobo 


N/T 


396 


Asn/Thr 


Damaging 


0.00 


D/ 


rs/3Z4oUUU 


c /c 
r/o 


OS/ 


rne/oer 


Damaging 


n no 


DO 


rs/buyuyucs 


U/L 


QQQ 


A c~ 1-1 1 1 1 
ASp/ulU 


Tolerated 


n 1"7 
U. 1 / 


CO 
DO 


rs/4y /y4ob 


n/Lz 


ooa 


Arg/uin 


Tolerated 


U.Ub 


/U 


rS /4yD oDDcj 


u/t 


Z4 


A 01-1 /r; 1 1 1 
Asp/uiu 


Damaging 


u.u 1 


71 


rs/4/DZo/o 


v /r* 
Y/0 


418 


Tyr/Cys 


Damaging 


0.00 


72 


rs /4 / 0 1 o4U 


C /M 


271 


Ser/Asn 


Tolerated 


0.26 


73 


rs/4b9olob 


D /I 

r/L 


401 


Pro/Leu 


Damaging 


0.00 


"7/1 


rs/4buuzbb 


|-/Y 


TIC 
Z ID 


rriG/ lyr 


Tolerated 


U.o4 


75 


rS/44bZ/4o 


o/b 


195 


uly/olu 


Damaging 


0.00 


/O 


■ Sb 1 /4o3Ub 


\A//R 

vv/rs 


104 


Trp/ Arg 


Damaging 


n nn 

U.UU 


-7-7 


rs 1 1 bbo 1 o4 


D ir\ 
n/U 


000 


Arg/uin 


Tolerated 


u.by 


78 


rszzoUzoo 


C /I/ 


326 


blu/Lys 


Tolerated 


0.86 


79 


rs 1 14 1 ozU 


1-1 /D 


60 


Mis/Arg 


Tolerated 


0.54 


80 


-1-1 >1 1 0 -1 Q 


HA' 


60 


His/Tyr 


Tolerated 


0.09 


81 


rs 1 1 4 1 0 1 b 


Ivl/ 1 


53 


iviet/ 1 nr 


Tolerated 


0.59 


82 


rsn4i8i4 


R/W 


48 


Arg/Trp 


Damaging 


0.00 


83 


rs 1 14 1 0 iz 


n/b 


44 


Arg/ber 


Tolerated 


0.14 


84 


-1-1 n '\Ci-\-\ 

rsn41o11 


T/I 


43 


Thr/lle 


Damaging 


0.01 


80 


•I'l >| '1 0'l -1 


1 /K 


4o 


1 nr/Arg 


Damaging 


0.02 


86 


rs 1 14 1 oUo 


C /I/ 
t/N 


41 


blu/Lys 


Tolerated 


0.52 


0"7 


rs 1 14 1 oU4 


o/b 


lb 


ber/biy 


Tolerated 


1.00 


88 


rs 1 14 1 oUz 


1 /c 
L/o 


lb 


Leu/Ser 


Tolerated 


0.63 


89 


rs l(Jb4bb 1 


U/H 


4uy 


Asp/His 


Tolerated 


0.05 


90 


rs IUb4b4o 


D /U 

n/H 


329 


Arg/nis 


Tolerated 


0.17 


91 


rs 1064644 


S/P 


196 


Ser/Pro 


Tolerated 


0.17 


92 


rs421016 


UP 


444 


Leu/Pro 


Damaging 


0.00 


93 


rs381737 


F/l 


213 


Phe/lle 


Tolerated 


0.18 


94 


rs381427 


V/E 


191 


Val/Glu 


Tolerated 


0.16 


95 


rs381427 


V/G 


191 


Val/Gly 


Tolerated 


0.16 


96 


rs368060 


A/P 


456 


Ala/Pro 


Tolerated 


0.09 


97 


rs364897 


N/S 


188 


Asn/Ser 


Tolerated 


0.17 



The consensus SNPs are shown in bold. 



were identified to be deleterious with a tolerance index score 
<0.05 (Kumar et al, 2009). Among 47 deleterious nsSNPs, 25 
nsSNPs were found to be highly deleterious. 

VALIDATION OF HARMFUL MUTATIONS 

The MutPred score is the probability that an AAS is 
deleterious/disease-associated. A missense mutation with a 



MutPred score >0.5 could be considered as "harmful," while a 
MutPred score >0.75 should be considered a high confidence 
"harmful" prediction (Li et al., 2009). Among the 47 deleterious 
nsSNPs, 8 were found to be harmful mutations with a score 
of >0.5 and <0.75 and 38 were found to be high confidence 
(highly harmful) mutations and 1 nsSNP found to be normal 
with the score of 0.193 (Table 2). 
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Table 2 | Prediction of functional effects of nsSNPs using MutPred. 



S. No 


rsID 


Alleles 


Position 


AA change 


MutPred prediction 


Score 


1 


rs121908314 


L/V 


371 


Leu/val 


High confidence 


0.824 


2 


rs121908313 


F/L 


251 


Phe/Leu 


High confidence 


0.778 


3 


rs121908311 


G/S 


377 


Gly/Ser 


Neutral 


0.193 


4 


rs121908310 


V/F 


298 


Val/Phe 


High confidence 


0.765 


5 


rs121908306 


C/G 


342 


Cys/Gly 


High confidence 


0.792 


6 


rs121908304 


W/C 


312 


Trp/Cys 


Harmful mutation 


0.735 


7 


rsl 21 908303 


F/V 


216 


Phe/Val 


High confidence 


0.879 


8 


rsl 21 908300 


Y/H 


212 


Tyr/His 


High confidence 


0.82 


9 


rs121908295 


P/R 


415 


Pro/Arg 


High confidence 


0.914 


10 


rs80356771 


R/C 


463 


Arg/Cys 


Harmful mutation 


0.664 


11 


rs80356769 


V/L 


394 


Val/Leu 


High confidence 


0.794 


12 


rs80205046 


P/L 


182 


Pro/Leu 


High confidence 


0.892 


13 


rs80116658 


G/D 


265 


Gly/Asp 


High confidence 


0.963 


14 


rs79796081 


DA/ 


127 


Asp/Val 


High confidence 


0.754 


15 


rs79696831 


R/H 


285 


Arg/His 


High confidence 


0.884 


16 


rs79653797 


R/Q 


120 


Arg/GIn 


High confidence 


0.902 


17 


rs79637617 


P/L 


122 


Pro/Leu 


High confidence 


0.835 


18 


rs79215220 


P/R 


166 


Pro/Arg 


High confidence 


0.836 


19 


rs791 85870 


F/L 


417 


Phe/Leu 


High confidence 


0.905 


20 


rs78911246 


G/V 


189 


Gly/Val 


Harmful mutation 


0.713 


21 


rs78715199 


D/E 


380 


Asp/Glu 


High confidence 


0.837 


22 


rs78396650 


A/V 


309 


Ala/Val 


High confidence 


0.776 


23 


rs781 98234 


H/R 


311 


His/Arg 


High confidence 


0.873 


24 


rs77829017 


G/E 


46 


Gly/Glu 


High confidence 


0.856 


25 


rs77738682 


N/l 


392 


Asn/lle 


High confidence 


0.814 


26 


rs77451368 


G/E 


202 


Gly/Glu 


Harmful mutation 


0.676 


27 


rs77321207 


Y/C 


304 


Tyr/Cys 


High confidence 


0.909 


28 


rs77284004 


D/A 


380 


Asp/Ala 


High confidence 


0.872 


29 


rs76910485 


P/L 


391 


Pro/Leu 


High confidence 


0.889 


30 


rs76763715 


N/S 


370 


Ans/Ser 


High confidence 


0.876 


31 


rs76763715 


N/r 


370 


Asn/Thr 


High confidence 


0.89 


32 


rs76228122 


Y/C 


363 


Tyr/Cys 


High confidence 


0.93 


33 


rs76026102 


Y/C 


205 


Tyr/Cys 


High confidence 


0.857 


34 


rs76014919 


W/C 


378 


Trp/Cys 


High confidence 


0.842 


35 


rs75564605 


l/T 


402 


lleThr 


High confidence 


0.838 


36 


rs75528494 


S/R 


366 


Ser/Arg 


Harmful mutation 


0.681 


37 


rs75385858 


N/T 


396 


Asn/Thr 


High confidence 


0.848 


38 


rs75243000 


F/S 


397 


Phe/Ser 


Harmful mutation 


0.724 


39 


rs74953658 


D/E 


24 


Asp/Glu 


High confidence 


0.818 


40 


rs74752878 


Y/C 


418 


Tyr/Cys 


High confidence 


0.872 


41 


rs745981 36 


P/L 


401 


Pro/Leu 


High confidence 


0.888 


42 


rs74462743 


G/E 


195 


Gly/Glu 


High confidence 


0.859 


43 


rs61748906 


W/R 


184 


Trp/Arg 


High confidence 


0.902 


44 


rs1141814 


R/W 


48 


Arg/Trp 


High confidence 


0.804 


45 


rs1141811 


T/l 


43 


Thr/lle 


Harmful mutation 


0.504 


46 


rs1141811 


T/R 


43 


Thr/Arg 


Harmful mutation 


0.579 


47 


rs421016 


L/P 


444 


Leu/Pro 


High confidence 


0.899 


The consensus SNPs are shown in bold. 












DISEASE-ASSOCIATED nsSNPs 






VALIDATION BY PANTHER 






Out of 47 


deleterious nsSNPs, 


43 were found to 


be a dis- 


The protein sequence was 


given as input and analyzed for the 


ease causing 


I nsSNPs and 4 were 


found to be neutral nsSNPs 


deleterious effect on protein function. The subPSEC 


scores are 


(Tables). 








values from 0 (neutral) to about —10 (deleterious) (Thomas 
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Table 3 | The results from nsSNP Analyzer, PMUT PROVEAN, and SNPs&GO. 



S. No 


rsID 


Allele 


Position 


AA change 


nsSIMP 
Analyzer 


PMUT 


PROVEAN 
Score Prediction 


SNPs&GO 


1 


1 O 1 1 C7UOO 1 H- 


1 A/ 


371 


i oi 1 A/a 1 


Mpi itral 

1 NCU LI a 1 


KIpi itral 
1 N cLJ ll a 1 


—2.331 


Neutral 


P Fcpa CP 
L-/ locaoc 


2 




F/L 


£3 1 


Pho/I oil 


L^l3t?a3C 


Pat rm 1/^^ 1 f^a 1 


—4.567 


Deleterious 


l^ic Aac A 


3 


rolOiqnQQII 


G/S 


377 


Glv/Spr 


riicpacp 

\-J 1 oCOOO 


Mpi itral 

F N C LJ IL a 1 


—5.128 


Deleterious 


P icpQcp 


4 


1 O 1 ^ 1 w UO 1 \J 


V/F 

V/ 1 


398 


Val/Php 

V G 1/ 1 lie? 


riicpacp 
\-J 1 oCaoC 


Mpi itral 
1 N c LJ ll a 1 


—4.185 


Deleterious 


Picpacp 


5 




C/G 


342 


pwe/filw 
l^ya/ Uiy 


Dicoaco 

L^l3t?a3C 


Pa t h n 1 1 a 1 
I a Ll luiuy iuai 


—11.467 


Deleterious 


Die pacp 
Lyiat?d3t? 


g 


rQl71Qnfi')nd 

13 l£ l9\/00\/*T 


W/C 


312 




Dicoaco 
L^isCasc 


Pa t h /\ 1 n n 1 a 1 
laLiiv/iuyiiiCii 


-12.258 


Deleterious 


Dicpacp 
iiyiaC?d3t? 


7 


[ o 1 Z. 1 CKUOOiJO 


FA/ 
r/ V 


216 


r 1 1 c/ V a 1 


Pi icoaco 


KIpi itral 
1 N cLJ ll a 1 


—7 


Deleterious 


P Fcpacp 


8 


rqlTiqnQ'SQQ 


Y/H 


212 


Tyr/His 


PlicpQcp 
l_> 1 oCaoC 


Mpi itral 

F N LJ 1 1 a 1 


—4.267 


Deleterious 


P icpacp 


g 


rQl71Qnfi7Qi^ 

13 l£ l9\/O^S>J 


P/R 


415 


Pro/ArQ 


Dicoaco 

L^l3C?a3C 


Pa t h n 1 1 a 1 
laLiiv/iuyiiiCii 


—8.793 


Deleterious 


Dicpacp 

Lyi3C?d3t? 


10 


r«i8n'^*ifi771 

ISOUOiJU/ / 1 


R/C 


463 


Am / f*\/c 


Dicoaco 

L^l3t?a3C 


Pa t h n 1 n n 1 o a 1 
laLiiuiuyiiiCii 


—5.279 


Deleterious 


Dicpacp 
Lyiac?d3t? 


11 


1 OOUOvJU /\JU 


V/l 


394 


\/al/l pii 

Va 1/ i—tJU 


Mpi itral 
1 >ic;u LI a 1 


Mpi itral 
1 N cLJ ll al 


-2.031 


Neutral 


P Fcpa CP 
L-/ locaoc 


12 






182 


Prn/I PI 1 


plicpacp 


Mpi itral 
1 N ^ LJ ll a 1 


-9.917 


Deleterious 


Picpacp 


13 


rqSnilfiBRR 


G/D 


265 




Plicpacp 


Mpi itral 

1 N C LJ IF a F 


—6.442 


Deleterious 


P icpacp 


14 


rQ7Q7QRnfi1 

13/9/ 9U\/U 1 


D/V 


127 


n3|J/ Val 


Dicoaco 

L^l3t?a3C 


Pat hnlnnif^al 
laLiiuiuyiuai 


-8.625 


Deleterious 


Dicoacp 
■iyiaC?d3t? 


1 o 


rQ7QRQRpQ1 


R/H 


285 


Arn/Hic; 
Ml y/ n lb 


U Ibcdbc 


Kloi itra 1 
INfcJU If d I 


—4.792 


Deleterious 


\\ 1 CO a c o 

U Ibcdbc 


16 


1 O / \J\J\J\J t ^ 1 


R/Q 


120 


Arn/nin 

1 y / v_i 1 1 1 


P) jcpacp 


Mpi itral 

F N LJ IF a F 


-3.641 


Deleterious 


P icpQcp 


17 


[ o / C/UO / U 1 / 


P/1 


122 


Prn/I oi 1 
r 1 u/ i_cu 


P icoaco 


KIpi itral 
1 N cLJ ll a 1 


—9.265 


Deleterious 


P Fcpacp 


18 


1 O / -ZJ iL. \ i_J Z_ W 


P/R 


266 


Prn/Arn 

i 1 LJ/ rAI y 


PicpQcp 
\—/ 1 oCaoC 


Mpi itral 

1 N C LJ IF a F 


—8.275 


Deleterious 


P icpacp 


1Q 


1 b / 3 1 OOO / U 


F/l 
r/L 


417 


Php/I PI 1 
r 1 1 c/ l_c U 


ll 1 c oa c o 
L-J Ibfcfdbc 


Kl OF Ftra 1 
1 N cU U d 1 


—5.095 


Deleterious 


\\ 1 CO a c o 

LJ Ibcdbc 


20 


1 o / OZ3 1 1 Z.H-U 


G/V 


189 


\j 1 y/ val 


P icoacp 


KIpi itral 

1 NCLJ ll a 1 


-6.4 


Deleterious 


P Fcoacp 
L^ locdoc 


21 




D/F 


380 


Aqn/niii 

/^O LJ/ \J 1 VJ 


KIpi itral 

J ^ C7 Li Ll a 1 


Mpi itral 

1 N LJ IF a F 


—3.797 


Deleterious 


P icpQcp 


22 


13/ 0^3\J\jJ\l 


A/V 


309 


Aia/Vai 

/^la/ Val 


Dicoaco 

L^l3t?a3C 


Pa t h n 1 n n 1 a 1 
laLiiuiuyiuai 


-3.533 


Deleterious 


Dicpacp 

liyi3C?d3t? 


23 


1 O / O 1 CJOZ-OH- 


H/R 


311 


Hi=/Am 

1 1 1 o/ rA 1 y 


P icoaQP 


KIpi itral 
1 N cLJ ll ai 


—7667 


Deleterious 


P Fcoacp 
L^ locdoc 


24 


r<s77R9Qni7 

1 3/ / 0^3\I 1 / 


G/E 


46 


\jiy/ uiu 


Dicoaco 

L^l3t?Cl3C 


Pat hnlnn!f*al 
I a Ll luiuy luai 


-5.925 


Deleterious 


Dicoacp 
iiyiaC?d3t? 


25 


r<;777^RfiR9 


N/l 


392 


A<5n/llp 

rno 1 1/ 1 1 tJ 


Picoacp 


Mpi itral 
E N cLJ ll a 1 


—7593 


Deleterious 


P Fcoacp 


26 


rs77451368 


G/E 


202 


GIv/Glu 


^I3ca3w 


Pnthninnipnl 

IdLIIVIUUIOCI] 


—5.178 


Deleterious 


DiQPfmp 

^/I3 Wu3w 


27 


r<^77?919n7 

1 o / / OZ 1 / 


Y/r 


304 


1 yi/>^yb 


P icpacp 


KIpi itral 
1 N cLJ ll al 


—8.358 


Deleterious 


P Fcpacp 


28 


rq779P4.nnd 


D/A 


380 


A^n/Ala 


P icoaQP 


KIpi itral 

1 NCLJ ll a 1 


—7593 


Deleterious 


P Fcoacp 






P/L 


391 


Prn/I All 


Ul3t?a3C 


Pat n/^ 1/^^ 1 a 1 


—9.269 


Deleterious 


r^iG Aa C A 


30 


rs7fi7fi?71 


N/S 


370 


An^i/^pr 

rAI lO/ vJCI 


Mpi itral 

1 ^ C7 Li Ll a 1 


Mpi itral 

1 N ^ LJ ll a 1 


—2.128 


Neutral 


Picpacp 


31 


rc;7fi7R?71 R 

1 O /U / DO 1 1 U 


N/T 


370 


AQn/Thr 

rAo 11/ 1 III 


P icoaQP 
L-' locaoc 


KIpi itral 

1 NCLJ ll ai 


—3.062 


Deleterious 


P FCOaCP 


32 


r<i7fi9?8197 

I3/U£^0 


Y/C 


363 


ly 1 / v^ya 


DicoacA 

Ul3CCl3C 


Pa t h /A 1 n n ! a 1 
idLiiv/iuyiLidi 


-8.492 


Deleterious 


DicAacA 

Lyi3C?d3t? 






Y/P 


205 


Ti/r /r*uc 

lyr/ v^ya 


Lf l3t?a3C 


Pat rm 1/^^ 1 f^a 1 
rdLIIUIUy lt<dl 


—7.552 


Deleterious 


r^ic Aac A 


34 


rs76014919 

13/ Uw 1 ^9 1 J 


W/C 


378 


II !>// wy 3 


^I3wa3w 


Pnthninnipnl 

IdLIIVIVUIOCII 


—12.306 


Deleterious 


DiQPfmp 

^ 13 Wu3w 


35 


13/ •JiJU'tUUU 


l/T 


402 


lleThr 


Dicpacp 

L^l3t?Cl3C 


Pa t h n 1 n 1 a 1 
idLIIUIUyiUdl 


—4.363 


Deleterious 


DicAacA 
L^iacdat? 


36 


13 / »J J^O*T J'-r 


S/R 


366 


^Ar/Arn 
oci / /Al y 


Dicpacp 

L^l3t?a3C 


Pathnlnnir^al 
idLIIUIUyiLidl 


—2.806 


Deleterious 


Die AacA 
L^iacdat? 


37 


1 o / ■JOOJOvJO 


N/T 


396 


A^n/Thr 

rAo 11/ 1 III 


P icoacp 


KIpi itral 
I N cLJ ll a 1 


—5.562 


Deleterious 


P Fcpacp 


38 


rs75243000 

13/ ^b^O www 


F/S 


397 


Phe/Ser 

III w/ w wl 


Dicpacp 
^I3^a3w 


Pnthninnipfil 

lCILII\./IUMIwCll 


-4.782 


Deleterious 


Di^pflQP 

l_f 1 o CI O 


39 


rqTd-QR^RRR 

1 o /M-c3 JOU JO 


n/F 


24 


rAo [J/ \J lU 


P ICPaQP 

L-' locdoc 


KIpi itral 

1 N CLJ ll ai 


-3.037 


Deleterious 


P Fcoacp 
VJ locdoc 


40 


rs74752878 

13/^/ «JbO # O 


Y/C 


418 


Tyr/Cys 


Dicpacp 

^I3^cl3w 


Pnthninnipfil 

IdLIIVIVUIOCII 


-8.526 


Deleterious 


Di^pflQP 


41 


rs74598136 


P/L 


401 


Pro/Leu 


Disease 


Pathological 


-8.136 


Deleterious 


Disease 


42 


rs74462743 


G/E 


195 


Gly/Glu 


Disease 


Pathological 


-7767 


Deleterious 


Disease 


43 


rs61 748906 


W/R 


184 


Trp/Arg 


Disease 


Pathological 


-13.028 


Deleterious 


Disease 


44 


rs1141814 


R/W 


48 


Arg/Trp 


Disease 


Pathological 


—6.879 


Deleterious 


Disease 


45 


rs1141811 


T/l 


43 


Thr/lle 


Disease 


Neutral 


-3.515 


Deleterious 


Disease 


46 


rs1141811 


T/R 


43 


Thr/Arg 


Disease 


Pathological 


-2.557 


Deleterious 


Disease 


47 


rs421016 


L/P 


444 


Leu/Pro 


Disease 


Neutral 


-4.995 


Deleterious 


Disease 



The consensus SNPs are shown in bold. 
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Table 4 | Mutant scores from PANTHER. 



S. NO 


rsID 


Alleles 


Position 


AA change 


subPSEC 


^deleterious 


1 


rs121908314 


L/V 


371 


Leu/val 


-3.34802 


0.58614 


2 


rs121908313 


F/L 


251 


Phe/Leu 


-2.59088 


0.39912 


3 


rs121908311 


G/S 


377 


Gly/Ser 


-5.35062 


0.91298 


4 


rs121908310 


V/F 


398 


Val/Phe 


-3.36629 


0.59056 


5 


rs121908306 


C/G 


342 


Cys/Gly 


-3.57193 


0.63921 


6 


rs121908304 


W/C 


312 


Trp/Cys 


-2.59838 


0.40092 


7 


rsl 2 1908303 


F/V 


216 


Phe/Val 


-4.88341 


0.868 


8 


rs121908300 


Y/H 


212 


Tyr/His 


-5.32716 


0.9111 


9 


rs121908295 


P/R 


415 


Pro/Arg 


-4.90228 


0.87015 


10 


rs80356771 


R/C 


463 


Arg/Cys 


-4.45218 


0.81033 


11 


rs80356769 


V/L 


394 


Va I/Leu 


-2.8436 


0.46098 


12 


rs80205046 


P/L 


182 


Pro/Leu 


-6.3153 


0.96495 


13 


rs80116658 


G/D 


265 


Gly/Asp 


-6.00914 


0.95299 


14 


rs79796061 


D/V 


127 


Asp/Val 


-6.29967 


0.96442 


15 


rs79696831 


R/H 


285 


Arg/His 


-4.32962 


0.79078 


16 


rs79653797 


R/Q 


120 


Arg/GIn 


-4.52062 


0.82063 


17 


rs79637617 


P/L 


122 


Pro/Leu 


-4.49053 


0.81616 


18 


rs792 15220 


P/R 


266 


Pro/Arg 


-6.27743 


0.96365 


19 


rs791 85870 


F/L 


417 


Phe/Leu 


-4.16977 


0.7631 


20 


rs78911246 


G/V 


189 


Gly/Val 


-3.38537 


0.59517 


21 


rs78715199 


D/E 


380 


Asp/Glu 


-2.02623 


0.27413 


22 


rs783g6650 


A/V 


309 


Ala/Val 


-4.2769 


0.78192 


23 


rs78198234 


H/R 


311 


His/Arg 


-4.57198 


0.82807 


24 


rs77829017 


G/E 


46 


Gly/Glu 


-5.04065 


0.885 


25 


rs77738682 


N/l 


392 


Asn/lle 


-4.02188 


0.73534 


26 


rs77451368 


G/E 


202 


Gly/Glu 


-1.32995 


0.15842 


27 


rs77321207 


Y/C 


304 


Tyr/Cys 


-6.26737 


0.96329 


28 


rs77284004 


D/A 


380 


Asp/Ala 


-2.36947 


0.34739 


29 


rs76gi0485 


P/L 


391 


Pro/Leu 


-6.12534 


0.95793 


30 


rs76763715 


N/S 


370 


Ans/Ser 


-2.69603 


0.42459 


31 


rs76763715 


N/T 


370 


Asn/Thr 


-1.97735 


0.26451 


32 


rs76228122 


Y/C 


363 


Tyr/Cys 


-4.75749 


0.85289 


33 


rs76026102 


Y/C 


205 


Tyr/Cys 


-5.89294 


0.9475 


34 


rs76014919 


W/C 


378 


Trp/Cys 


-5.31772 


0.91033 


35 


rs75564605 


l/T 


402 


lleThr 


-3.78009 


0.6857 


36 


rs75528494 


S/R 


366 


Ser/Arg 


-2.07688 


0.28432 


37 


rs75385858 


N/T 


396 


Asn/Thr 


-3.61569 


0.64924 


38 


rs75243000 


F/S 


397 


Phe/Ser 


-2.88329 


0.47086 


39 


rs74953658 


D/E 


24 


Asp/Glu 


-4.17446 


0.76395 


40 


rs74752878 


Y/C 


418 


Tyr/Cys 


-6.31864 


0.96506 


41 


rs74598136 


P/L 


401 


Pro/Leu 


-2.14888 


0.2992 


42 


rs74462743 


G/E 


195 


Gly/Glu 


-4.74669 


0.85153 


43 


rs61 748906 


W/R 


184 


Trp/Arg 


-3.5793 


0.64091 


44 


rs1141814 


R/W 


48 


Arg/Trp 


-703366 


0.9826 


45 


rs1141811 


T/l 


43 


Thr/lle 


-4.20869 


0.77007 


46 


rs1141811 


T/R 


43 


Thr/Arg 


-4.05221 


0.7412 


47 


rs421016 


L/P 


444 


Leu/Pro 


-3.43747 


0.60766 



The consensus SNPs are shown in bold. 
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24D— >24E 



120 R—> 120 Q 



371 L ■ 



371 V 



370N— > 370T 



370 N -—> 370 S 



417 F—> 417 L 
216 F—> 216V 
266P— >266R 



265 G — > 265 D ^ 



212 Y — > 212 H 
311H— >311R 



1-5; 

53-10: 
103-155 
156-;06 
207-256 
257J06 
307-356 
357-406 
407-457 
458-497 



qiWA RY 



ARPC Pt SF 
MGPI ^\ xHtBTpLf^ 
LLLKiVl SEBtl 
LKIPI IH ? A 



FVtrLDAi 



RDFL 

GLV\H\VVLDFL 



TK DTF^ K( >P.M FYHLGHFSKF P 



\ V 



309 A — >309V 
312 W — >312C 
463R — > 463 C 
363 Y—> 363 C 
415 P — >415R 
366S— >366R 
418 Y — > 418 C 



\ LN I.S5 ] 0)^ T LTIKDP.WGI L E T S 'GYSIHTYL^ K ? Q 



377 G 



377 S 



RDLGPTIA^ 'THH^^ RLL ULI DQRLLLPmVAia ^XTDPE.A.\KY^ H 
VP.- v.\TLGETHF LFF NTMLFASE.4C\"GSKF\VEQSVRL|gS 



SHSIITMLLYmA GAT rD^A NL.4LNPEGG P;y\\yRXF\ 



378 W — > 378 C 
127 D—> 127 V 
380D— > 380A 



380 D—> 380 E 



G! Q R\ GU ASQK s [ LI A\: a 



O ATVCDSFDPP [ FPAL( iTFSRVE >TR.SGF R> 
L( iPEQKFQia KGFGGA UTDA.A^\LMLAI..SPPAQj\ 
VNllR\ P.\USCT)FSIRt|v T\ AD|rPDDFQl)KSFSL PEf DTfC 
AQ|4P^^SLLASPV TSPTW .KTNG.W NGKGSLKGC PGDFi'H 
AEHKLQFW VVtJvEN iPSAGLLSGVPFQCLGFTPEHQ 



[ELS 



DSPIR 



IHPDGSAV 



DI 




195 G—> 195 E 
43 T — > 43 R 



46G — > 46E 
251 F—> 251 L 

48R — > 48W 
202 G—> 202 E 
205 Y — > 205 C 
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FIGURE 1 I Sets of various mutations identified using various software tools. The respective locations of 44 amino acids responsible for all 47 mutations 
are shown in the sequence (center, colored in bold) and 22 common mutations are highlighted as consensus. 



et al., 2003). Out of 47 deleterious nsSNPs, 8 were found to be 
more than —6 (highly deleterious) and rest were found to be less 
deleterious. The mutant with a greater -Pdeleterious tends to have 
more severe destructions in function. It was found that 32 out of 
47 deleterious nsSNPs scored greater than 3 and rests were below 
the damage threshold (Table 4). 

FUNCTIONAL IMPACT OF MUTATIONS ON PROTEINS 

The functional impact of 47 deleterious nsSNPs in protein of 
GBA was analyzed using PMUT server. Of the 47 nsSNPs, 22 



are classified as pathological, and the remaining were neutral 
(Tables). 

PROTEIN VARIATION EFFECT ANALYSIS 

PROVEAN predicts the effect of the variant on the biological 
function of the protein based on sequence homology. PROVEAN 
scores are classified as "deleterious" if below a certain threshold 
(here -2.5) and "neutral" if above it (Choi et al, 2012). Out of 47 
nsSNPs, 44 were predicted to be "deleterious" and 3 were found 
to be "neutral" (Table 3). 
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PREDICTION OF DISEASE RELATED MUTATIONS BY SNPs&GO 

SNPs&GO is trained and tested with cross-validation proce- 
dures in which similar proteins are placed together as a dataset 
to calculate the LGO score derived from the GO data base. 
All 47 deleterious nsSNPs showed the disease related mutations 
(Tables). 

DISCUSSION 

In the recent years, SNPs have emerged as the new generation 
molecular markers. The harmful SNPs for the GBA gene were 
never been predicted to date in silico. This study was designed to 
understand the genetic variations associated with GBA gene. We 
have predicted the harmful nsSNPs using SIFT, MutPred, nsSNP 
Analyzer, PANTHER, PMUT, PROVEAN, and SNPs&GO state of 
the art computational tools. Among 97 nsSNPs, 47 were found 
to be deleterious with a tolerance index score of <0.05 found 
by SIFT program. Among the 47 deleterious nsSNPs, 46 were 
found to be harmful nsSNPs found by MutPred, 43 were found 
to be disease causing nsSNPs by nsSNP Analyzer tool, 32 are 
highly deleterious found by PANTHER program, 22 are classi- 
fied as pathological mutations by PMUT, 44 were predicted to 
be deleterious by PROVEAN server while all 47 deleterious nsS- 
NPs showed the disease related mutations by SNPs&GO. Also, 
we found that SNPs&GO was most successful of all state of the 
art SNP prediction programs that were used for this compar- 
ative study. In this work, we found 22 nsSNPs that are com- 
mon in all (SIFT, MutPred, nsSNP Analyzer, PANTHER, PMUT, 
PROVEAN, and SNPs&GO) prediction (Figure 1). These sets 
of 22 nsSNPs (F251L, C342G, W312C, P415R, R463C, D127V, 
A309V, G46E, G202E, P391L, Y363C, Y205C, W378C, I402T, 
S366R, F397S, Y418C, P401L, G195E, W184R, R48W, and T43R) 
are possibly the main targeted mutation for the GD (Tables 1-4). 
The previous work has shown that, in 60 patients with types 1 and 
3, the most common Gaucher mutations identified were L444P, 
N370S, and R463C. L444P was the most common mutation in 
GD types 1, 2, and 3 (Latham et al., 1990; Sidransky et al, 1994). 
In our analysis, out of 7 methods, 6 methods (Sift, MutPred, 
PROVEAN, PANTHER, nsSNP Analyzer, and SNPs&GO) shows 
L444P mutation as damaging, 3 methods shows N370S muta- 
tion as damaging and all the 7 methods shows R463C mutation 
as damaging. D409H, A456P, E326K, and V460V mutations were 
also identified in patients with GD (Tsuji et al., 1987; Park et al., 
2002). In our analysis SIFT result shows D409H, A456P and 
E326K mutation is the tolerated mutation. Further studies using 
these mutations will shed light on the genetic understanding of 
this major lysosomal storage disease. 
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